• HN Mail
  • Subscribe

REINFORCEMENT LEARNING

Show HN: TextPolicy – reinforcement learning for text generation on a MacBook
4 points | 0 comments

Scaling Reinforcement Learning: Environments, Reward Hacking, Agents, Data
2 points | 0 comments

AI and Games
2 points | 0 comments

How TRM Labs Scaled Security with Self-Improving AI Vulnerability Agents
13 points | 0 comments